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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 

Field of exception situation recovery invention of data processing system This invention relates to the 
exception situation method of recovery and equipment of data processing system. In especially this 
invention operating the data processing system controlled by the sequence of the instruction which 
forms a program (a) A step including the data transfer of the 1st type depending on the machine 
condition that performed said program while specifying the number of the checkpoints of said program, 
and said conversion was specified with the data transfer in said system, (b) The step which answers the 
exception situation produced during activation of this step (a), and interrupts said program execution, (c) 
It is related with the data-processing-system actuation approach equipped with the step which reruns 
said program from the latest checkpoint after solution of said interruption. 

Moreover, this invention is data processing system. A means to generate the sequence of the instruction 
which forms a program on the occasion of activation by this system, A means to identify the checkpoint 
in said program While performing said program, data transfer of the 1st type is performed. The 1st data 
path for which each of such data transfer depends on the machine condition that said system was 
specified, A means to answer the exception situation and to interrupt said program execution In data 
processing system equipped with a means to rerun a program from the recently checkpoint after solution 
of said exception situation Said system A means to connect with said data path and to record each of 
said 1st type of data transfer from a checkpoint recently [ said ] While interrupting said account data 
path of rerun Nakamae It is related with the data processing system characterized by having further a 
means to reproduce the recorded data transfer instead of actual data transfer synchronizing with said 
rerun. 

Such an approach and a system are known from 1469-1514 pages of "Checkpoint Repair for High- 
Performance Out-of-Order Execution Machines" by W.-M.W.Hwu and Y.N.Patt of IEEE Transactions 
on Computers published in December, 1987, Vol.C-36, and No. 12. 

Explanation of a related technique Data processing system is various "exception situations" produced 
while performing a program. 

It needs to be alike and it is necessary to harmonize and cope with it. The exception situation contains 
overflow and the page fault of interruption and count. (It was generated by the external asynchronous 
event) If actuation of these interruption is delayed, it can process by **, until a system reaches the 
condition which can save interruption after interruption is processed, and can be recovered of having 
been specified good. This is 68000. It is carried out with a microprocessor family and interruption is 
processed in this case only on a micro structure boundary. To it, it can be considered that overflow of 
count is serious to activation of a part of program, consequently the relation of it is lost with a machine 
condition. 

Other criteria of the exception situation are generated between the usual program executions, and control 
is given to hardware from software in this case. The main examples as which effective processing is 
required are the "page faults" of a virtual storage system, and become accessible [ an addressable 
predetermined part ] at the time of predetermined in an instant in this case. Other examples are TLB of a 
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system in case the trap and storage management unit (MMU) which emulate the op code which is not 
performed do not have table Wolk hardware. The processing in the special case in count of the Myst lap 
and the point floating [ IEEE ] is included. This type of exception situation is processed in an instant, no 
matter the condition of a machine may be what thing, and quick and the utility of the bulk memory 
which needs to process so that it may have dependability, namely, has a virtual addressing are lost. For 
example, MC68020 After processing the exception situation, it is necessary to make it recover, while 
carrying out the stack of the condition of the relation of a processor then and saving. Since a machine 
has the working decision nature, it is made for a machine to resume actuation to accuracy to all the 
practical objects by making the condition that it turns out that it is effective recover a machine, so that 
the exception situation may never occur. 

The processor of the higher engine performance is in the inclination which becomes denser pipeline 
arrangement. Since an additional hardware data path is required, the condition of such a machine 
becomes very large and most conditions exist in an expensive pipeline stage to access. Therefore, it 
comes to specify the "checkpoint" in the case of being special in a program actually. If it arrives at a 
checkpoint while performing, as long as it relates to future actuation, backup copy forms a machine 
condition. After processing the exception situation, it is necessary to restore the condition of a processor 
to the thing of the latest checkpoint, and to resume activation from a checkpoint. 
The exception situation recovery using a checkpoint is indicated by 1469-1514 pages of "Checkpoint 
Repair for High-Performance Out-of-Order Execution Machines" by W.-M.W.Hwu and Y.N.Patt of 
IEEE Transactions on Computers published in December, 1987, Vol.C-36, and No. 12. Hwu And Patt 
maintains the backup copy of a small register file, and explains the approach and equipment which 
restore a machine to the condition of a checkpoint. Backup copy is not practical to the mass machine 
condition which contains the full capacity of main memory in many cases. Therefore, Hwu And patt 
saves the value overwritten by the data transfer to the memory produced from a checkpoint recently, and 
in order to restore a condition before resuming activation from a checkpoint, it explains how to write in 
this value. Although the saved value occupies a very small tooth space, the rewrite of these values 
wastes time amount and has a possibility that the whole processing system performance may fall. 
A checkpoint retry system is the Europe patent specification 0348652nd. It is indicated by the number. It 
is held at a specific buffer until the instruction whose value written in cache memory specifies a 
checkpoint is completed, and a change of state is delayed in this system until it turns out by this that the 
exception situation has not arisen. 

The type of the processor for which it depends on parallel execution and dense pipeline arrangement 
substantially is a "very long instruction word" (VLIW) type processor. "Ideal VLIW" processor and 
TRACE A known practical use VLIW processor as a processor 1987 year 10 month 5-8 On a day Palo 
Alto of the California week () [ Palo ] Architectual Support of Alto for Programming Languages and 
Operating Systems Proceedings of (ASPLOS II) of the Second International 180-192 of Conference 
R.P.Colwell of a page etc. — it is indicated by the paper of the title of 'A VLIW Architecture for a Trace 
Scheduling Compiler' to twist, exception situation recovery - being related - Colwell etc. — once the 
exception situation is solved, the "history queue device" in which interrupted memory actuation is 
performed again will be explained vaguely. 

The video signal processor which has VLIW architecture is indicated by the above-mentioned Europe 
patent specification title No. (PHN 12.376) 325,310 which carries out relation. For the specific 
application of video signal processing, the small cyclic program which used the physical memory is 
used, consequently the problem of the exception situation which can be restored does not arise. The 
same configuration of a more common processor will be 3-5 in October, 1988. Computer Design of a 
day 'CREATE-LIFE by this invention person of Proc.IEEE International Conference : A Design System 
for High Performance VLSI circuits' and the Europe patent specification title 479,390 It is indicated by 
the number (PHA 1209). Since it is desirable to require mass virtual memory when using this structure 
for a general processing application, the problem of the exception situation arises. It is checked about 
loss of a circuit dimension and/or performance that the disadvantageous profit using a known recovery 
system becomes severe especially by the VLIW processor. 
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Europe patent specification 479,390 Other reference explaining the VLIW processor indicated by the 
number has not made reference in the approach of the exception situation recovery fitted to it 
(responding to a case). 

The approach these systems "cannot delete a control pipeline" easily, especially other problems relevant 
to a known exception situation recovery system decrease a circuit dimension by the other approach to 
predetermined performance level in a VLIW processor is needed. 

Epitome of invention The object of this invention is disadvantageous and/of performance. Or it is 
offering other exception situation methods of recovery which decrease a circuit dimension. The object of 
this invention is offering the suitable exception situation method of recovery using a dense pipeline 
processing system like a VLIW processor especially. 

In operating the data processing system controlled by the sequence of the instruction which forms a 
program, this invention is equipped with the above-mentioned step (a), (b), and (c), and said step (a) 
offers how said step (c) synchronized with said rerun instead of actual data transfer including playback 
of the recorded data recently [ said ] including record of said 1st type of each data transfer from a 
checkpoint. 

Instead of restoring the machine condition before the rerun in a checkpoint, the approach by this 
invention disregards the present machine condition, and constitutes the recorded value which is acquired 
from the exact machine condition before exception situation generating. Hwu And the small circuit 
dimension of the method of patt is maintained, and the inconvenience of performance including 
restoration of an exact machine condition is removed simultaneously. A known approach restores all the 
values overwritten from the checkpoint irrespective of whether it has the effect of arbitration on rerun. 
The approach by this invention memorizes and reproduces only the value which was used for original 
activation at accuracy, therefore is required of rerun to it. 

"programmer which includes the content of the pipeline stage in programming like a processor and 
which was limited dramatically ~ visible — " condition is specified. Under the present circumstances, a 
checkpoint specifies the point of the program relevant to actuation of the future [ condition / this / 
visible ]. a single string of each of the fragmentation which has arranged by dividing this into the 
fragmentation which carried out checkpoint assignment of the program, and carried out checkpoint 
assignment in this way is working it has only one pre DESS A which may exist. If it puts in another 
way, a control flow will be branched suitably in a checkpoint. A means in the specification which cannot 
avoid branching to be able to carry out the schedule of the program so that only a visible condition may 
relate to a junction (a pipeline is empty), namely, to satisfy an exact condition is established. 
Said step (c) may also include the restart of usual activation of said program while recording said 1st 
type of data transfer according to said step (a), if said rerun reaches the point that said activation was 
interrupted for said step (b). By this approach, the exception situation of other arbitration following the 
same checkpoint can be coped with automatically. 

When required, the step required of solving the exception situation is a known approach, and it depends 
for it on this contractor containing specific hardware and/or a specific software element at the property 
of the exception situation. These steps do not form a part of this invention. 

As a result of interruption of activation of said step (b), this transmitted control of said system to the 
exception situation manipulation routine from said program, was delayed in the transfer of said control, 
and the example which enables it to complete those activation explained it with the instruction generated 
before said exception situation. It can carry out as [ affect / the specified machine condition / forbid 
simultaneously the actuation started by the delay before a transfer of the control to an exception situation 
manipulation routine, and / these actuation ]. In case the usual exception situation resumes from the 
point of interruption, these means can be used if needed so that a machine condition may become exact. 
Actuation can be forbidden using the "guard" facility of a LIFE type VLIW processor, without using 
hardware especially. By using this facility (or what is equivalent to this), it says [ it / that this arises ] 
with the conventional exception situation recovery system and is satisfactory, and control of pipeline 
arrangement can also be deleted. 

This invention offers the suitable data processing system and the equipment for actuation according to 
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the approach by above-mentioned this invention. 

Easy explanation of a drawing The gestalt of operation of this invention is explained to a detail with 
reference to a drawing. 

Drawing 1 shows the linearity gestalt of the processor by this invention. 
Drawing 2 shows a format of the very long instruction word of the equipment of drawing 1 . 
Drawing 3 shows a part of imitation multiport memory of the equipment of drawing 1 . 
Drawing 4 shows actuation of the program fragmentation of the 1st example. 

Drawing 5 shows the scheduling of the program fragmentation of the 1st example to the equipment of 
drawing 1 including the problem of a collision. 

Drawing 6 shows some processors including exception situation detection and a recovery circuit. 

Drawing 7 shows record/regenerative circuit of a processor more to a detail. 

Drawing 8 shows the scheduling of the program fragmentation of the 2nd example. 

Drawing 9 shows actuation of the exception situation recovery circuit under 2nd-example activation of 

program fragmentation. 

Drawing 10 shows other examples of record/regenerative circuit of a processor to a detail. 
Scheme of a processor The processor of drawing 1 is equipped with the imitation multiport store circuit 
30 connected to five functional units which contain the branching unit BR in the constant unit CO, count 
and Logical unit AL, the register unit RG, and a data memory unit DM list. The specific register SP 
which has the storage unit 26, a multiplexer 27, and the multiplexer 29 of relation in a store circuit 30 is 
illustrated in linearity. The principle of such actuation of a store circuit and a processor is indicated by 
the Europe patent specification No. (PHA1209) 479390, SURABEMBAGU and a love loss could refer 
it, and both sides already explained it. These principles did not form a part of this invention, but since it 
was simple, those publications were omitted. 

Sequencer SEQ And instruction generating pipeline IIP The global controller 41 which it has is formed. 
Sequencer SEQ Instruction generating pipeline IIP Program counter WORD PC supplied is generated in 
each cycle. Instruction generating pipeline IIP Being able to consider as an OFU chip, this contains the 
program memory by which the very long instruction word which forms an executive program is 
encoded. Instruction-after delay of predetermined number of cycles generating pipeline IIP The very 
long instruction word IW memorized to the address PC of program memory is supplied to the 
instruction register 46 on chip. 

The instruction word IW of a register 46 is destination control field DEC. Functional units CO, AL, RG, 
and DM according to individual And two or more fields COC, ALC, RGC, DMC, and BRC which 
control BR, respectively It has another bit CP which marks a checkpoint, although it is an easy example 
for the explained pro SESSAHA emergency - the total of the bit of instruction word IW — 163 it is . 
Drawing 2 shows the field of instruction word IW more to a detail, and these fields have the number of 
bits of each field attached under the field. The object of these various fields is making it clear from the 
configuration of a processor, and detailed explanation of the following of operation. The object of the 
checkpoint marking bit CP is explaining to a detail below with reference to drawing 6 -10 of a drawing. 
The constant unit CO does not have the operant input section, and has the output section (CO.O) as a 
result of 32 bits linked to the input section of a store circuit 30. The constant unit CO is 32 bit field COC 
of instruction word IW ( drawing 2 ) actually. It has the direct continuation between the output sections 
of the conte stunt unit CO. Thereby, the constant unit CO can supply constant value to other units from 
within the limits of instruction word through a store circuit 30. 

Count and Logical unit AL have the guard value input section (AL.G) of 1 bit and the two 32-bit 
variable input sections (right and left) (AL.L and AL.R) linked to each memory unit in a store circuit 30. 
Unit AL has the 32-bit output section (AL.O) which made reverse connection in a store circuit 30, and is 
the 16 bit field ALC of instruction word IW. It is controlled. 

Drawing 3 shows a part of imitation multiport store circuit 30 relevant to count and Logical unit AL to a 
detail. In a store circuit 30, they are operand AL.G and AL.L about the storage units (RAM) 48, 50, and 
52 according to individual. And it is made to relate to AL.R, respectively. 
Furthermore, if it contrasts with other usual processors, the write-in enable signal according to 



http://www4.ipdl.ncipi.go.jp/cgi-bin/tran_web_cgi_ejje 



1/11/2006 



JP,09-509515,A [DETAILED DESCRIPTION] 



Page 5 of 16 



individual will not be supplied to the storage unit in the equipment of drawing 1 -9. Instead, in case it 
does not ask for at least one writing of as opposed to [ exist namely ] the storage unit for the storage 
location in which at least one writing is impossible, it can address. If it assumes that it is already known 
when the value of a write-in enabling bit is scheduling, in case a result will be assigned to the location of 
a specific storage unit, the width of face of instruction word IW is decreased when these enabling bit 
does not exist. 

The storage unit 48 to guard bit AL.G has eight 1 bit positions including an explained location in which 
one writing is impossible for memorizing a Boolean value (TRUE/FALSE bit). The triplet read-out 
address to the storage unit 48 is control field ALC of instruction word IW by the instruction register 46. 
It is supplied from 3 bit-field AL.G.RA ( drawing 2 ), and the 1-bit read-out port 54 of a storage unit 
supplies guard bit AL.G to Unit AL from the addressed storage location. 

A multiplexer 56 has two or more 1-bit input sections linked to the 32-bit minimum important bit (lsb) 
connected to the output sections (CO.I, AL.O, etc.) as a result of various functional units. About the 
input section of an addition of a multiplexer, it is lsb of the additional 32-bit bus line 58. It connects, and 
this conveys output SP.O from the delay element of the gestalt of the specific register SP, as a result of 
being delayed. When it is used preparing the delay element which solves the collision of this type of 
VLIW, it is the above-mentioned Europe patent specification 479,390th. The technical problem of a 
number (PHA1209) is formed. It is lsb to a guard bit. When it is used, it comes to have compatibility 
with the predetermined programming convention which includes programming of C, for example. In 
other situations, the maximum important bit (msb) or other bits can be chosen more as convenience. 
A multiplexer 56 has the 1-bit output section 60 linked to the write-in port of the storage unit 48. A 
multiplexer 56 is destination control field DEC of instruction word IW. Selection field AL.G.SE of a 
triplet It is controlled. The write-in address of the triplet to the write-in port of the storage unit 48 is 
destination control field DEC. Adjoining 3 bit-field AL.G.WA It is given. 

Similarly, the 32-bit storage units 50 (as opposed to AL.L) and (as opposed to AL.R) 52 have the 32-bit 
multiplexers 62 and 64 of relation, respectively. 

Each has the location including the location in which one writing is impossible of 16 pieces. Destination 
control field DEC Two triplet selection field AL.L.SE which controls multiplexers 62 and 64, 
respectively And AL.R.SE Two 4 bit-field AL.L.WA(s) which supply a 4-bit write-in port address to 
each of the storage units 50 and 52 And AL.R.WA It contains. Control field ALC Two 4 bit-field 
AL.L.RA And AL.R.RA The read-out address to the read-out port of the storage units 50 (as opposed to 
AL.L) and (as opposed to AL.R) 52 is generated. 

op code AL.OP to count and Logical unit AL Control field ALC to Unit AL It is given by five bit fields. 
It sets in each cycle and is op code AL.OP. In order to generate result AL.O, it is determined [ which ] to 
argument AL.L and AL.R whether activation (addition, subtraction, a comparison, AND, OR, etc.) of 
operation should be carried out. 

If it returns to drawing 1 , the register unit RG will offer the storage on chip to 16 program variables 
whose each is 32 bits and which are used frequently. Unit RG has the storage unit of the relation in the 
store circuit 30 to guard bit RG.G and 32-bit data input RG.I. These storage unit has only four locations 
where each includes the location in which one writing is impossible. Data output (RG.O) is connected to 
the input section of the store circuit 30 which forms each bus line (refer to RG.O and drawing 3 ). 
Destination control field DEC Selection field RG.G.SE of two triplets which control the multiplexer of 
relation And RG.I.SE Two 2-bit write-in address-field RG.G.WA(s) chosen among four locations of 
each storage unit And RG.I.WA It contains. 

9-bit register unit control field RGB Inside, they are two 2 bit-address RG.G.RA. And RG.I.RA It 
prepares in order to choose from each storage unit the location of the value of value RG.G which should 
be carried out reading appearance, and RG.I. 5-bit op code field RG.OP It has 1 bit chosen between 
read-out actuation of REJITA, and write-in actuation of a register, and 4 bits chosen among 16 registers 
in Unit RG. Therefore, the location of the variable in the register unit RG is fixed by the compiler. 
The data storage unit DM includes a lot of storage to a variable at least notionally. Actually, Unit DM is 
equipped with the interface over a bank of read/write memory on chip so that it may explain to a detail 
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later with reference to drawing 6 . In the imitation multiport store circuit 30, the storage unit according 
to individual is prepared to guard bit DM.G and 32-bit address word DM.A and 32-bit close KADETA 
WORD DM.I. These storage unit has the location including the location which cannot write in each of 
256 pieces. The data storage unit DM has the 32-bit data output (DM.O) connected to the input section 
of the storage unit 30 which forms each bus line (also see DM.O and drawing 3 ). Destination control 
field DEC Three triplet selection field DM.G.SE and DM.A.SE which control the multiplexer in a store 
circuit 30 And DM.I.SE Each 256 of the storage unit of relation Three 8-bit write-in address-field 
DM.G.WA(s) and DM.A.WA which are chosen between the locations of an individual And DM.I.WA It 
contains. 

25-bit data storage unit control field DMC Inside, they are three 8-bit read-out address DM.G.RA and 
DM.A.RA. And DM.I.RA Value DM.G which should be carried out reading appearance from each of 
those storage units, and DM.A And it prepares in order to choose the location of DM.I. 1-bit op code 
field DM.OP 1 bit chosen between read-out actuation of storage and write-in actuation of storage is 
included. Address word DM.A specifies the address to the writing of data word DM.I, or read-out of 
data word DM.O. Thereby, dynamic count of 32-bit sufficient address to address the location of 232 can 
be performed among program execution. 

Sequencer SEQ which should be corrected so that it may branch during program execution with the 
branching unit BR The sequence of generated instruction address PC is permissible. Unit BR has the 
storage unit of the relation to guard bit BR.G and 32-bit destination address BR.A in a store circuit 30. 
These storage unit has four locations including the location which cannot write in each. Destination 
control field DEC Two triplet selection field BR.G.SE which controls the multiplexer of relation And 
BR.A.SE Two 2 bit write-in address-field BR.G.WA(s) chosen among four locations of each storage 
unit And BR.A.WA It contains. 

4-bit branching unit control field BRC Inside, they are two 2-bit read-out address BR.G.RA. And 
BR.A.RA It prepares in order to choose from each storage unit the location of value BR.G which should 
be carried out reading appearance, and BR.A. The branching unit BR is Sequencer SEQ about guard bit 
BR.G and destination address BR.A. It is made to pass simply, op code field BR.OR It is not required. 
Sequencer SEQ The Boolean value of guard bit BR.G is tested inside. When BR.G is truth, branching 
should be performed, and it is Sequencer SEQ. The next value of PC equal to destination address BR. A 
which received from the store circuit 30 is created. It is Sequencer SEQ when BR.G is a false. The 
increment of the value of PC is only carried out to PC+1, and the next instruction of the memorized 
program is addressed. 

Therefore, what (the truth or the false of lsb) logic result AL.O is taught to the storage unit to BR.G for 
can perform facultative branching. (Or it is always carried out, it is never carried out) A fixed result 
teach [ branching which is not facultative / a storage unit / as opposed to BR.G for CO.O ], i.e., when a 
storage unit always arranges truth or the location which always becomes false and which cannot be 
written in, it can perform. Since at least one location of each storage unit always cannot write in the 
addition of this latter as already explained, it is convenient especially in the gestalt of operation of this 
invention. (Branching is never performed) Guard bit BR.G=FALSE is often required, when branching 
actuation should be started in a predetermined cycle. 

Sequencer SEQ Bus line (refer to PPC and drawing 3 ) False program enumerated data PPC memorized 
by the input section of the store circuit 30 to form It is made to generate. PPC other functional units 
containing the branching unit BR — using it - a program - counting - correlation addressing can be 
performed. For this reason, it is the value PPC of a bus line to a storage unit about read-out delay. The 
number of cycles and PPC of an instruction to read It can specify as a difference between the numbers of 
cycles of an instruction which have PC equal to a value. 

Cycle i and instruction word IW (i+b) which branching delay of working and b cycle makes generate 
branch instruction 1st cycle i+b performed by branch instruction It generates in between. The change-of- 
state actuation which can teach the same Boolean value of the conditions of a decision instruction to 
guard bit input section AL.G, RG.G, DM.G, and BR.G, and includes another branching and by which 
the schedule was carried out to the various functional units under branching delay can be used for the 



http://www4.ipdl.ncipi.go.jp/cgi-bin/tran_web_cgi_ejje 



1/11/2006 



JP,09-509515,A [DETAILED DESCRIPTION] 



Page 7 of 16 



mediation cycle forbidden facultative. By getting to know the possibility of branching, within branching 
delay of the facultative decision instruction which may happen, the schedule of the effective actuation 
can be carried out with a compiler, and in case branch condition suits, when simultaneously few, an 
exact condition is satisfied. 

In order to start the actuation which is not guarded with the most sufficient convenience, suitable read- 
out address .R.A is answered in the location which cannot write in each guard bit storage unit, and it 
arranges so that the read-out port may be made to generate truth. It can also arrange so that the value of 
which the location which cannot write in a 32-bit operand storage unit is required frequently [ of a 
certain known ], for example, zero, may be generated. 

Only the register of off chip memory and the register unit RG and the content of the program counter PC 
are considered to be the parts of the programmer visible condition of the processor of this gestalt. The 
IMITETEYON multiport store circuit 30 can be strictly used with the compiler as a buffer to a 
temporary medium variable. Consequently, a functional unit like the constant unit CO is not needed for 
a guard. Even if count and Logical unit AL itself cannot change the programmer visible condition of 
equipment, the guard who avoids an exception situation like overflow of the count produced in false to 
Unit AL is prepared. 

In addition to branching delay of the branching unit BR, a scheduling compiler needs to have the so- 
called information of "latency" of each functional unit. While generating the number and the results 
(AL.O etc.) of actuation of the cycle which starts actuation for the latency of a functional unit, it is 
destination control field DEC. It specifies as a difference between the number of cycles which teaches 
the result to the location of one or more requests of the imitation multiport store circuit 30. About the 
already explained branching delay, the branching unit BR is the latency and Sequencer SEQ of a cause. 
And off-chip instruction generating pipeline IIP The sum with the latency of a cause can be considered. 
Example of scheduling For the following examples, the latency to a functional unit is assumed as 
follows. 

Unit CO-0 cycle latency Unit AL-1 cycle latency Unit RG-1 cycle latency Unit DM-two-cycle latency 
Unit BR-5 cycle branching delay The following list 1 is equipped with the fragmentation of a high-level 
language source code. Variables p and q show two records in memory. The objects of a program are two 
things to copy to the field of corresponding p record from the field of q record of two values of 32 bits. 
List 1 p A . field 1 :=q A . field 1 ; 
p A . field 2:=q A . field 2; 

Drawing 4 illustrates the configuration which may have data which can be used for executing a high- 
level instruction of a list 1. High-speed register file REG which has the location of 16 pieces which 
attached the number of 0 to 1 5 for storage of a program variable And (off chip) bigger bank MEM than 
that of read/write memory It carries out. About the local base address LB, it is a register file REG. A 
location 0 is made to memorize. Address LB is a memory bank MEM. Initiation of the table of an 
adjustable value is shown and two record pointers p and q are location LB+12 in this case. And it is 
recorded on LB+8, respectively. About each variables p and q, it is a memory bank MEM. It considers 
as the pointer to initiation of corresponding record arranged except. Offset of zero is made to memorize 
the 1st field of each record from the address to which it was specified by the pointer variable of relation. 
Offset of 4 is made to memorize the 2nd field of each record from the pointer variable of relation, 
therefore, the value q A . field 2 — the address - q+4 it is — it can read from a location. 
A list 2 shows the pseudo code fragmentation which attached Tl which makes it generate with the 
compiler which executes a high-level instruction of a list 1 by the VLIW processor, when the data 
configuration explained with reference to drawing 4 is assumed. Each line of a list 2 is single basic 
actuation OpOOl -Op014 which should be carried out a schedule to activation by the suitable functional 
unit. It specifies. The comment (*, --, *) of the end of each line makes reference in the types (AL, RG, 
CO, etc.) of the functional unit demanded first, and explains the result (output) of the actuation after 
that. 

(for example) Actuation Op003 Reference number 001 And 002 Op003 The receiving operand is the 
result of OpOOl, and Op002, respectively. It expresses that it is a result. Therefore, it is actuation OpOOl 
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about the effect of actuation Op003. A result and actuation Op002 It adds to a result and is actuation 

Op009 simultaneously. By effect, it is Op008. It is a result Op004 of off chip data memory It should 

write in the location addressed by the result, the list 2 which will perform the function specified by the 

list 1 if explanation of above-mentioned drawing 4 is combined with this information — each — of 

operation OpOOl -Op014 The object becomes obvious. 

List 2Tl:Op001 rgreg (0); (* RG and local base address *) 

Op002 constant (12); (offset value * to * CO and p) 

Op003 aplus 001, 002; (address [ of * AL and p ] *) 

Op004 read 003; (* DM,p *) 

Op005 constant (8) (offset value * to * CO and q) 

Op006 aplus 001, 005; (address [ of * AL and q ] *) 

Op007 read 006; (* DM,q *) 

Op008 read 007; (* DM, q A . field 1 *) 

Op009 write 004 008; (* DM, p A . field l:=q A . field 1 *) 

OpOlO constant (4); (offset * to * CO and the field 2) 

OpOl 1 aplus 004, 010; (address [ of * AL and the p A . field 2 ] *) 

Op012 aplus 007, 010; (address [ of * AL and the q A . field 2 ] *) 

Op013 read 012; (* DM, q A . field 2 *) 

Op014 write 011, 013; (* DM, p A . field 2:=q A . field 2 *) 

GOTO EXIT The sequential list of the pseudo code fragmentation Tl is carried out, and, in a suitable 
case for at least one functional unit to perform each actuation, it turns out that this fragmentation is not 
specified as the specific gestalt of the arbitration of a VLIW processor. Once a compiler once reaches 
collection of such fragmentation that constitutes an application program and a actual equipment gestalt 
serves as known, it is the task of a scheduler to map each actuation of fragmentation in the specific 
functional unit of the equipment of a specific cycle. 

A scheduler operates by constraint of the lot specified by the configuration of a specific VLIW 
processor. These constraint mainly contains the number of functional units and type which can be used 
by the target VLIW processor, and the latency of each functional unit. Other constraint contains the 
number of the locations of each storage unit of an imitation multiport memory. 

These locations are dynamically assigned if needed by the scheduler. For the problem of easy scheduling 
expressed by fragmentation Tl, the number of the locations of each storage unit of the VLIW processor 
of drawing 1 -4 appears in extent which does not have to consider a limit of the capacity of a storage 
unit as constraint of this description enough, and a certain thing is understood. 

Since it is not necessary to carry out a schedule until it makes the latency of the actuation of everything 
but all by actuation depending on the result of other actuation complete, the latency of each actuation is 
important. Therefore, actuation Op009 They are a two cycle and Op008 at least after Op (DM latency is 
2)004. It is necessary to carry out a two-cycle schedule at least later. Similarly, it is actuation OpOl 1. 
Although a schedule can be carried out in the same cycle as Op (CO latency is zero)010, it is necessary 
to consider as the two-cycle back at least than Op (DM latency is 2)004. This assumes that the writing 
and read-out of a storage unit location can be performed simultaneously. When that is not right, there is 
a possibility of needing another latency of 1 cycle for actuation of each type. 
Drawing 5 shows three different schedules (I, II, and III) which each maps in the specific VLIW 
processor which explained the pseudo code fragmentation Tl with reference to drawing 1 -4. Each line 
of each table expresses one very long instruction word IW, therefore one machine cycle. The cycle 
number which attached Cy to the head is given to the leftmost column. The following five columns are 
five control fields COC, ALC, RGC, and DMC to five functional units (CO of drawing 1 etc.). And 
BRC It expresses. Each control field start actuation with each functional unit. Therefore, at Schedule I, it 
is Field COC in a cycle 0. It orders to a constant unit (CO of drawing 1 ) to start Op005, consequently 
appears in the output section of the unit CO whose result CO.O=8 are the same (CO latency is zero) 
cycle. Similarly, it is Field RGC in a cycle 0 about the read-out actuation OpOOl of a register. It starts. 
The value LB of a local base address follows and is RG.O in a cycle (RG latency is 1) 1. 
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Destination control field DEC under each instruction Selection field DEC. .-.SE It expresses to the 
right of functional unit control field. These fields are various of operation OpOOl -Op(s)014. A result is 
taught to each storage unit so that such a result can be used. 

A scheduler is specified destination control field DEC. The latency of various functional units is taken 
into consideration, therefore, the field RGC of a cycle 0 from started register read-out actuation 
OpOOl a result — RG.O — the next instruction and field DEC.AL.L.SE of a cycle 1 It is taught to count 
and the left-hand side operand storage unit of Logical unit AL. This is expressed with the code RG of 
the AL.L train of the cycle 1 of Schedule L the same ~ the memory read-out actuation Op007 — the field 
DMC of a cycle 2 from — starting — simultaneous — the result of the actuation — DM.0 — a cycle (DM 
latency is 2) 4 - field DEC.DM.A.SE It teaches the address storage unit of the data storage unit DM by 
setting up 'DM'. 

A scheduler is the actuation Nw015 of itself. And Nw016 It adds and the jump by the following 
fragmentation is performed ('GOTO EXIT 1 of a list 2). Nw015 The constant unit CO is used in a cycle 1, 
and it is selection field DEC.BR.A.SE in a cycle 1. By setting it as 'CO' prescribes the target address to 
branching taught to the address storage unit of the branching unit BR. Nw016 It is the branching-control 
field BRC in a cycle 1. It uses and unconditional (BR.G=1) branching to the target address is started. A 
transfer of the control to the target address stops arising to a cycle 8 after fragmentation Tl completion 
by long branching delay of 5 cycles. If it puts in another way, branching actuation is shown into a 
schedule and can carry out the schedule of the effective actuation to the activation under branching 
delay. The schedule only of the abbreviation 1/3 of actuation of the maximum number which may 
happen is actually carried out to the schedule I of drawing 5 . It is because this cause is the example of 
the fragmentation of very small size, and since extent of coincidence becomes high as a flag mate 
becomes long, if the schedule of the longer fragmentation is carried out, it can be understood to this 
contractor that the schedule of the effective high actuation of a rate can be carried out. 
It is conquered with the gestalt of the operation explained to longer fragmentation by utilization of guard 
bit .G of a Boolean value which can use that facultative branching actuation of many propers can 
arrange most facultative branching so that it may have a branching probability close to 1 or 0. For 
example, branching it can be predicted that produces the loop formation which should be executed 100 
times in the probability of 0.99 or 0.01 is meant. 

The schedule I of drawing 5 is performed by the VLIW machine which has a true multiport memory, 
and it is destination control field DEC. It ignored thoroughly. However, Schedule I is of operation 
OpOOl -Op014 to the processor of drawing 1 -4, including [ therefore ] the collision of access to the 
storage unit of an imitation multiport memory. It is not an effective map. Especially, in the cycle 5 of 
Schedule I, it is Op004. Result DM.0 is sent to count, the left-hand side operand storage unit (AL.L) of 
Logical unit AL, and the address storage unit (DM. A) of the data storage unit DM. It sets in the same 
cycle 4 and is Op012. Result AL.O is also sent to the address storage unit (DM. A) of the data storage 
unit DM. It sets in the cycle 5 of Schedule I, and this collision is field DEC.DM.A.SE. It is marked by 
Code XX. 

When a delay element (the specific register SP) does not exist, a scheduler is forced so that the re- 
schedule of these actuation may be carried out, in case a collision is discovered, until constraint of 
processor hardware is satisfied. In this example, this serves as the schedule II of drawing 5 . 
It sets on Schedule II and is Op004. 1 cycle delay is carried out and it starts in a cycle 4. However, 
actuation Op009 and OpOl 1 And Op014 Op004 Depending on a result therefore, at least one cycle also 
of these actuation is also delayed. It sets on Schedule I and these actuation is other actuation Op012, 
Op008, and Op013. And Op014 It is necessary to carry out the re-schedule also of these actuation by 
overlap and hardware constraint mutually. 

Although the process of this re-schedule, i.e., "back track actuation", can be made to complete so that 
hardware constraint may always be satisfied, nine cycles of fragmentation Tl by which the re-schedule 
was carried out are occupied instead of 8 in this case. This means taking this long part of an application 
program compared with the case of the ideal VLIW machine which has the same fiinctional unit. 
Furthermore, back track actuation means the compiler including solution of the new collision generated 
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in case the first collision is solved in an additional work piece. It becomes serious especially by the big 
machine by which a collision almost generates the problem of the effect of the increasing compiler in a 
cycle at each time. It is clear that existence of a collision is generally undetectable until the schedule of 
two or more instructions is carried out exceeding the cycle which the collision generated. By 10 or the 
big machine of the functional unit beyond it, this requires the re-scheduling of dozens of times or 
hundreds of actuation to which the new collision produced as a frequent thing during re-scheduling may 
take place. 

The schedule III of drawing 5 is the simultaneous continuation application 594,534. The analysis of the 
example of the collision without back track actuation using the delay element of the gestalt of the 
specific register SP which was explained in detail by the number is shown. Destination selection field 
DEC Selection field DEC.SP.SE It is shown in the right-hand side of Schedule III. Of operation OpOOl - 
Op014 And Nw015 -Nw016 It turns out that it is almost the same as that of the thing of Schedule I 
(precise multiport memory). However, the collision of the cycle 5 of a between is Op004 two results 
which goes to a DM.A storage unit in Schedule III. Teach result DM.0 to a specific register, i.e., field 
DEC.SP.SE. Destination control field DEC By setting it as 'DM', it is solvable. Field DEC.DM.A.SE In 
this case, it is set as 'SP' in a cycle 6, and is Op009 of a cycle 6. It is Op004 when reading appearance 
should be carried out with the data storage unit DM. The result of having been delayed is taught to a 
DM.A storage unit. 

Op004 The result of having been delayed collides with the result of OpOl 1 in a cycle 6 in this case, and, 
as for these results, both sides go to a DM.A storage unit. Therefore, it sets on Schedule HI and is 
OpOl 1 . Initiation is delayed to a cycle 6, without having continuous effect on actuation of other 
arbitration. It sets in a cycle 5 and other solutions to this 2nd collision are OpOl 1 . It leaves as it is, it sets 
in a cycle 6 simultaneously, and is field DEC.SP.SE. It is OpOl 1 by setting it as 'AL 1 . It is delaying a 
result. OpOl 1 The result of having been delayed is field DEC.DM.A.SE in a cycle 7. It is Op014 by 
setting it as 'SP'. It can teach a DM.A storage unit at the time of initiation. 

He is trying for the specific register SP to maintain the engine-performance level of a true multiport 
machine, while the single collision of two values always arises in the predetermined cycle of arbitration, 
avoiding back track actuation. This will say that a functional unit can read only one value from a storage 
unit in each cycle, and means that one side of a collision result is always required in front of another 
side. Because of basis, it is OpOl 1 . A result is Op004. It is Op004 when it turns out that it is what is 
demanded more compared with a result. The number of two cycles, i.e., arbitration, can carry out cycle 
delay of the result until it will become more urgent than a collision result. In all cases, the need for the 
continuous re-scheduling of other actuation is avoided. 

The number of the functional units which should be included, and selection of a type can be used 
sufficiently freely. Each unit of the specific gestalt of drawing 1 can combine all of the group of these 
functions with a functional unit more common multiple-purpose type to the maximum flexibility in 
scheduling, while being specified to some extent as performing the group of a specific function or a 
function. Since sufficient pipeline is permitted, I hear that it is desirable to have a certain amount of 
latency to all actuation of a predetermined unit as for the inconvenience of this type of unit, and it has it. 
This means that it is necessary to slow down actuation quicker than (for it to be like fixed generating) to 
the minimum working speed. In order to perform for example, a floating point multiplication function, a 
barrel shift function, or an I/O function, it is necessary to enable it to use the functional unit type 
specified more to it also for a predetermined application, since the unit same type as being used by the 
usual microprocessor is contained in the processor of this invention this contractor ~ good - 
arrangement — that is, it is made to suit easily 

Memory interface circuitry which has exception situation recovery Drawing 6 shows the configuration 
of a suitable memory interface circuitry to use it as a data memory unit DM of the processor of drawing 
1 -4. Signal DM.G (guard bit), DM.A (address), and DM.I (write-in data) are received from the imitation 
multipoint memory 30 (refer to drawing 1 ). A storage management unit (MMU) supplies signal DM.0 
to a store circuit 30 through a circuit 100 while receiving these signals through a certain (not shown to 
drawing 1 ) exception situation recovery circuit 100. op code signal DM.OP (read-out/writing) is 
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received from an instruction register 46 (refer to drawing 1 ). MMU Large-scale off chip main memory 
MEM The interface to receive is offered and virtual memory address assignment well known by this 
contractor is performed especially in itself. 

Signal DM. A and DM.I are not affected in the exception situation recovery circuit 100, but guard-bit 
DM.G is MMU as signal DM.G' simultaneously. Before making it pass, it is AND in the protection 
enforcement signal GF. It calculates. When Signal GF is in the condition of logic' 1' (truth), signal 
DM.G' is equal to signal DM.G. However, '0' (false) is forced DM.G' when Signal GF is logic '0' (false). 
The protection enforcement signal GF is guard-bit AL.G of other functional units of the processor of 
drawing 1 , and RG.G. And BR.G (it does not explain), 
It is AND with the same approach. It calculates. 

The exception situation recovery circuit 100 is a multiplexer 102, and the record/regenerative circuit 
RRC. It contains. Circuit RRC Playback mode signal REP A multiplexer 102 is controlled. REP - 
between logic '0' and a multiplexer - MMU from ~ generated signal DM.O' is transmitted to the 
imitation multiport store circuit 30 as signal DM.O. however, REP Between logic T, and 
record/regenerative circuit RRC from - generated playback data signal DREP is transmitted to a store 
circuit 30 instead of DM.O'. therefore, playback mode signal REP the recording mode which is 
DM.O=DM.O' - DM.O=DREP it is — it distinguishes from a playback mode. Record / regenerative 
circuit RRC Signal DREC=DM[ from the output section of a multiplexer 102 ] .0 It takes out. 
(It characterized by signal GF= T and REP= '0*) In normal operation, substantially, the broken-line box 
100 becomes transparent and becomes result DM.G -DM.G and DM.O=DM.O\ MMU It is each cycle 
and is main memory MEM. It is condition DM.G=, while reading value DM.O from location DM.A or 
being dependent on op code DM. OP. It rules over truly and is main memory MEM. The demand which 
writes value DM.I in location DM.A is received. MMU Record / regenerative circuit RRC Memory 
MEM from - the event signal ETV (ETV=1) showing the availability of value DM.O by which reading 
appearance was carried out — record / regenerative circuit RRC It supplies. 

MMU TLB explained later again Supplying the loss signal TLBM to the input section of the OR gate 
104, the output section of the OR gate is the exception situation detecting signal EXDET. Sequencer 
SEQ And it conveys to the instruction generating pipeline IIP (refer to drawing 1 ). The input section of 
another side of the OR gate 104 is used for receiving an exceptional event start signal from other 
functional units. 

Signal EXDET The reset input section (R) of the set-reset flip-flop 106 which synchronizes once for 
every machine cycle is supplied. Instruction generating pipeline IIP Exception situation handling signal 
EXH And checkpoint signal CPT It is made to generate and they are record / regenerative circuit RRC 
about these signals. It supplies. Exception situation handling signal EXH Supplying the set input section 
(S) of a flip-flop 106, the output signal (Q) of a flip-flop 106 constitutes the protection enforcement 
signal GF simultaneously. 

Drawing 7 is record / regenerative circuit RRC. It is shown more in a detail. Time amount counter TIC 
which synchronizes once for every machine cycle Checkpoint signal CPT It receives and the time 
amount signal TREC is generated. The time amount signal TREC is connected with Signal DREC, and 
it is the event memory ECM. The data input section (DIN) is supplied. Signal DM.O is the event 
memory EVM, if it assumes that it is width of face of 8 bits to the time amount signal TREC, since it 
has 32 bits. A data input has width of face of 40 bits. The time amount signal TREC is the checkpoint 
signal CPT and the event signal EVT again. And exception situation handling signal EXH It is the event 
count logical circuit EVC like. It is supplied. Logical circuit EVC It is the event memory EVM by each 
of address signal ADDR and the write-in enable signal WE. It controls. 

Event memory EVM It has 40-bit data output DOUT, and is a logical circuit EVC about the 8 bits event 
time amount signal TREP of this output, and a 32-bit playback data signal. And latch 108 is supplied, 
respectively, latch 108 — logical circuit EVC from — the following event signal NXT supplied it 
synchronizes ~ having — simultaneous — the latch output section .— playback data signal 
EREP is supplied to a multiplexer 102. 

The configuration of the storage management unit to sufficiently known virtual-memory actuation and 
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detailed explanation of operation are not required for this contractor here because of an understanding of 
this invention. If it summarizes, it will be MMU of this gestalt. A translation lookaside buffer (TB or 
TLB) is included, and this translation lookaside buffer is Memory MEM about the virtual address 
(DM.A). Two or more entries mapped in a physical address are included. It is TLB only about the subset 
of the address which may exist. It can map in predetermined time amount and is TLB. When address 
DM.A can be mapped by the existing entry, the demand of read-out to address DM.A or writing can be 
fitted only in the inside of data memory unit latency (two cycle of this gestalt). When address DM.A 
cannot map by the existing entry, this needs to constitute the exception situation and needs to interrupt 
program execution. This so-called TLB About a loss, it is new (the existing entry is deleted) TLB. Since 
it is solvable by creating an entry, it is TLB. A loss is called important [ "un-important" ] or a "possible 
[ recovery ]" exception situation. MMU Signal TLBM=1 is used and it is TLB. While expressing 
generating of a loss, exception situation handling and a recovery process are started. 
Example of activation which has the exception situation The schedule of drawing 8 is already carried out 
to two or more very long instruction word (refer to IW of drawing 2 ), and it shows in linearity the 
program fragmentation of the 2nd example memorized by program counter location PC=0 and refer to 1 
or 2 or less. All actuation by which the schedule was carried out is not illustrated and it is main memory 
MEM in the case of activation. Five actuation which carries out 1 increment to the value memorized by 
the position is shown. Fragmentation does not use intentionally the value which already exists in the 
imitation multiport store circuit 30. Therefore, the machine condition of the relation in PC=0 exists in 
the register of the data memory unit DM (main memory MEM) and the register unit RG thoroughly. 
Furthermore, the schedule of the actuation which is not completed before PC=0 is not carried out in 
front of PC=0. Therefore, the checkpoint instruction which expressed to PC=0 by the asterisk '*' in 
drawing 8 can be shown. When flows of control "branch" in a checkpoint, the condition of this latter 
always is not required and this means that a medium history (the pipeline's content) is always the same 
at the time of checkpoint attainment. 

If an entry is carried out to the fragmentation of PC=0 in the 2nd example, the pointer to the storage 
location of relation will be memorized by the register 0 of the register unit RG. Of operation 'rdreg' 
started by PC=0 copies the pointer from here to a temporary location tl of the imitation multiport store 
circuit 30. In this case, tl is received by PC(RG latency is 1) =1. Of operation 'read tl 1 started by PC=1 
reads the value memorized in the storage location of relation in the 2nd temporary location t2. In this 
case, t2 is received by PC(DM latency is 2) =3. In the meantime, of operation 'constant(l)' started by 
PC=2 loads a delta value 1 to the 3rd (CO latency is 0) temporary location t3. Of operation 'aplus t2 t3' 
started by PC=3 permutes the sum of the value from memory t2 and an increment t3 by the 4th 
temporary location t4. In this case, t4 (AL latency is 1), 

It receives by PC=4. Finally, of operation 'write tl t4' started by PC=4 returns to the primary-storage 
location also showing a temporary location tl , and writes in the value t4 which carried out the 
increment. In this case, t4 reaches PC(TLB assuming that there is no loss) =6. 

In case wave drawing 9 performs program fragmentation of the 2nd example of drawing 8 by PC=5, it 
shows the exception situation recovery process using drawing 6 in the case of assuming that the 
exception situation (for example, TBL loss) is detected, and the circuit of 7. Top line'PC- of drawing 9 
expresses the program enumerated data of a configuration of generating from an IIP pipeline in the 
continuous cycle of a processor started by PC=B which has branching to checkpoint PC=0. The 
following, Signal CPT, TREC (time amount signal enumerated data are shown), EVT, EXDET, and GF 
and EXH And REP A wave is expressed. It sets at the time of wave-like initiation, and they are Signals 
CPT and EVT, EXDET, and EXH. And REP All are set to '0' and GF is set to '1'. These express normal 
operation (recording mode which does not deal with the exception situation), since the number signal 
TREC of hour meters is a former (not shown) checkpoint - counting - it becomes a cycle. 
A checkpoint instruction is expressed to program memory by setting bit CP=T as the instruction word 
(referring to drawing 2 ) of relation. With this gestalt, it is set as instruction PC=B which starts the 
branching actuation to a checkpoint rather than attached to checkpoint instruction itself in Bit CP. This 
means gives advantageous warning of each checkpoint of 5 cycle, and relaxation timing is checkpoint 
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signal CPT=1 by branching delay. The circuit which is generated or answers a checkpoint by the option 
is restrained. Naturally, it is signal CPT=1 when it is FALSE (BR.G=FALSE and/or GF=0) by which 
branching actuation is guarded. It does not generate. It is the checkpoint signal CPT to the last of cycle 
PC=L of the last of branching delay. At the time of initiation of the cycle which attached 'C, it sets up 
by the instruction generating pipeline (in logic "1"). In this case, it is referred to as PC=0 at the time of 
the beginning. 

When a return needs to be carried out so that the exception situation may be followed, it is checkpoint 
signal CPT=1. Sequencer SEQ PC value of a checkpoint is made to memorize, checkpoint signal CPT=1 
[ moreover, ] Time amount counter TIC in record / regenerative circuit RRC ( drawing 7 ) zero - 
resetting (TREC=0) — an event — counting - logical circuit EVC Address ADDR - zero — setting up - 
event memory EVM It clears effectively. The increment of the number signal TREC of hour meters is 
carried out in the following cycle, respectively, and it acts as a rule of thumb of the time amount 
progress from a checkpoint. Event signal EVT Value by which reading appearance was carried out from 
memory (being the gestalt of signal DM.0=DM.O') 

It is MMU, when it can use first and the cycle (PC=3) which attached T is received. It sets up. event 
signal ETV=1 answering - an event — counting - logical circuit EVC Event memory EVM the write-in 
enable signal WE - setting up - record data signal DREC=DM.O=DM.O f ~ the "time amount stamp" 
of the gestalt of the number signal TREC of hour meters -- current - event memory EVM equal to 3 The 
first location is made to memorize. According to this, the increment of the event memory address ADDR 
is carried out for preparation of storage of the following event. 

It sets to E (PC=5) and is exception situation detecting-signal EXDET=1. It is TLB as shown. A loss or 
other exception situations occur. Thereby, a flip-flop 106 is made to generate guard enforcement signal 
GF=0, and activation of meantime PC=6 and PC=7 is continued. By these two cycles, actuation of the 
arbitration containing for example, of operation 'write tl' is started before an exception situation, and 
those activation is completed. Therefore, these and other actuation which are started before the 
exception situation is detected change a machine condition from the condition which exists in a 
checkpoint of having been specified good. Even if it starts new actuation in these two cycles, the 
actuation started by protection enforcement signal GF=0 after the exception situation arises does not 
affect a machine condition, consequently the machine condition is exact at time amount R. 
It is used by a sequencer and the instruction generating pipeline, and the two-cycle delay between E and 
R is TLB. In the case of the exception situation of a loss, it is MMU. TLB Transfer control to the 
exception situation handling routine (PC=X) which sets up a new entry is performed. 
This delay is shorter than the usual branching delay, and makes min disadvantageous profit of time 
amount to processing of the exception situation. Signal EXH=1 which answers carrying out a return in 
the guard enforcement signal GF T, and generates the persistence time of an exception situation 
manipulation routine in time amount R It is marked and is the time amount counter TIC. Counting is 
stopped (TREC=7). 

An exception situation manipulation routine is operated using the subsegment of addressable memory 
eternally and directly. By this means and suitable programming, exception situation manipulation- 
routine itself guarantees not generating absolutely the exception situation which is not important, 
reference Colwell which explained this means in itself, for example, already, etc. ~ from - it is known 
at this contractor. The actuation as which an exception situation manipulation routine is required is the 
approach of sufficiently known [ contractor / this ], and is MMU. It depends and other publications of 
these are not required for an understanding of the exception situation recovery indicated here. Exception 
situation handling can also be performed by specific hardware, without preparing clearly by the 
programmer. For example, 680XO(s) It sets to a processor and is TLB about specific "table Wolk" 
hardware. It prepares in order to process the exception situation of loss. 

It goes to termination of an exception situation manipulation routine, and is Sequencer SEQ. pC value of 
the latest checkpoint (this example PC= 0) is recovered, and he is the instruction generating pipeline IIP 
about it. The instruction word IW which inputted, consequently was memorized by PC=0 is loaded to 
the time amount which attached C by drawing 9 at an instruction register 46, and this starts rerun of the 
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program fragmentation of the 2nd example started from checkpoint PC=0. About the checkpoint signal 
CPT, they are record / regenerative circuit RRC. Time amount counter TIC It answers resetting the time 
amount signal TREC to zero, and sets up at the time of this cycle, and counting of the cycle from a 
checkpoint is also started with this checkpoint signal, an event - counting — logical circuit ECV the 
value DREC which reset the address ADDR of event memory to zero, and was recorded by T as a result, 
and time amount stamp TREP=3 — event memory EVM It can use in the data output section DOUT. 
rerun - hitting - an event - counting — logical circuit EVC Playback selection-signal REP=1 It sets up. 
therefore, the multiplexer 102 — MMU from - the data path of generated signal DM.CV — being 
interrupted — latch 108 — minding — event memory EVM from — generated playback data signal DREP 
is constituted. It sets to time amount T f of rerun, and is Circuit EVC. It detects that between the time 
amount stamps TREP remembered to be the present time amount signals TREC is equal, and is Signal 
NXT. It uses and the value recorded on the time amount E under original activation is inputted into latch 
108. Therefore, value DM.0 supplied to the imitation multiport store circuit 30 by time amount T' is the 
memory MEM of the original time amount T. It becomes a value and does not become the delta value 
memorized by of operation 'write tl t4' under original activation of program fragmentation here, 
therefore, the value for which it depends on a former exact machine condition instead of the value 
depending on the actual machine condition bad specified by activation among the playback mode — 
event memory EVM from - it turns out that it is supplied. Setting to PC=6 under rerun, of operation 
'write tl t4' is only Memory MEM during original activation. Overwrite of the same value is only carried 
out to the memorized delta value. Since only an exact input is received from the machine condition 
under rerun (as), the exact output to a machine condition is guaranteed. It is suitable that the reason for 
explaining with the header according to the following individuals performs read-out and no circuit 
actuation into a playback mode actually. 

It is Circuit EVC if it returns to drawing 8 and the example of activation of 9. It is a regenerative signal 
REP just before time amount R' (PC=8) rerun exceeding the point that activation was interrupted in 
order to process the exception situation. It resets. It is the event memory EVM about the event (data 
transfer) of everything but arbitration by this. Exact recovery is attained when it is one or more of other 
events of arbitration in which it can add to the event (up to Event T) recorded before, therefore rerun can 
be required from the same checkpoint. 

Main memory MEM [ in / in the case of a larger-scale processor, two or more data storage units DM are 
formed, and / each cycle ] The receiving multi-access is made possible, the gestalt of such [ naturally ] 
operation ~ record/regenerative circuit — main memory MEM from — each data path is required. Since 
the whole processor synchronizes and operates, it is Circuit RRC. The existing component TIC, for 
example, a time amount counter, It is sharable between two or the circuit beyond it. A storage unit 
besides these is the same main memory tooth space MEM from the memory according to other 
individuals, in order to increase the range of parallel processing. Additional parts are offered ideally. 
Since the register unit RG also includes a programmer visible machine condition, the description of 
exception situation recovery is also established. However, Hwu of the reference already quoted using the 
backup copy register since the amount of the condition of Unit RG was dramatically small With the 
usual approach which it reaches and was indicated for the example by patt, it is storage **** about the 
condition in each checkpoint. 

Therefore, since it is brief, the exception situation recovery description of a register unit is not indicated 
further, when the number of registers cannot hold economically [ and ] to perfect backup copy as an 
alternative, this contractor understands that a register (even — or **) unit output-data path (RG.O) can 
include record/regenerative circuit. [ very ] 

Furthermore, in the gestalt of another operation, content itself of an imitation multiport memory can be 
included in the definition of a machine condition, and each memory unit (26 drawing 1 ) establishes the 
exception situation recovery description with the gestalt of a backup storage unit, or a 
record/regenerative circuit in this case according to the size of the storage unit concerned. 
Record/regenerative circuit without a time amount stamping Drawing 10 shows other examples of 
record/regenerative circuit of drawing 7 , and is the time amount counter TIC in this case. It does not 
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prepare but is the event memory EVM about a time amount stamp value (TREC). It is not made to 
memorize using a data entry. This is the event (generated from MMU- drawing 6 ) signal EVT to time 
amount T' under playback which was carried out to the time amount T under activation of the beginning 
of a checkpoint instruction although the control circuit of the circumference of it was clearly simplified 
while decreasing the size of event memory. It is dependent on generating, therefore, other examples - 
setting « record/regenerative circuit — event signal EVT in a recording mode "point which reproduces 
the value newly remembered that the event signal in a playback mode is expressed while remembering 
that it is expressed by the always new value DREC - coming — it can enter --" "point - coming - 
taking out — " - that is, a FIFO memory is offered effectively. 

Naturally, it is the event storage control circuit EVC. While maintaining a number of a value of counting 
recorded on accuracy from the checkpoint and generating Address ADDR, in case rerun passes the point 
which activation interrupted for the former attempt, the change to a recording mode from a playback 
mode is performed. 

Naturally, a system designer is the starting value and the event signal EVT of Address ADDR to the 
easy example of drawing 1 0 . It is necessary to satisfy not reproducing the value corresponding to 
[ number / of manifestation ] accuracy in both a recording mode and a playback mode which has already 
recorded the playback value on accuracy. 

Constraint of the memory access of a playback mode Deformation of the gestalt of the above-mentioned 
implementation can be required of a actual system. For example, as already explained, instruction^ <- 
read tV is rerun in rerun of the program fragmentation ( drawing 8 and 9) of the 2nd example, 
instruction' write tl t4' is permitted after that, and the value written in with the same instruction during 
original activation is processed, and overwrite is carried out. While the inaccurate value by which 
reading appearance was carried out from the address tl is disregarded, and while the value by which 
overwrite was carried out to the address tl is exact, it is desirable to forbid the exact read-out actuation 
and the write-in actuation in a playback mode generally. 

As for access actuation of the storage location, this reason has often carried out the trigger of other 
events, therefore a repeat [****/ un-] is performed. Actually, an I/O device like UART for serial 
communication is often addressed as the mere storage location ("memory by which the map was carried 
out"). However, in such a case, a value is transmitted twice to a remote device by actuation which writes 
the same value in the storage location twice. Instead of receiving character string'DATA', a remote 
device receives 'DAATA 1 . Similarly, signal transmission of the WORD is usually carried out at UART 
permitted by the processor, and the following alphabetic character which is not the same produces read- 
out actuation of the alphabetic character received from the UART register by the repeat of read-out. 
Therefore, when a remote device transmits train'DATA', a processor receives 'DAA\ 
I/O which carried out the memory map Even when equipment does not exist, repeat memory read-out 
actuation can carry out the trigger of the event which a virtual storage system does not expect. For 
example, in activation of instruction^ <-read tl 1 of the program fragmentation of the 2nd example, 
while accessing the storage location tl at accuracy, data conversion is the event memory EVM. It is 
assumed that it was recorded. Furthermore, the page of the memory which includes a location tl while 
processing the exception situation is TBL. It is assumed that it is thrown away in order to pack "new 
refer to the page." If rerun begins from a checkpoint, the value demanded is the event memory EVM. It 
is TLB, if storage read-out actuation in which it succeeded around the last time amount is performed 
even if recorded. The exception situation of a loss arises. Clearly, it is very desirable to have not asked 
for this situation, and to arrange also to this reason, so that memory access may be controlled to the 
validity in a playback mode. 

Deletion of a control pipeline By giving the information of the timing of each type of actuation more 
detailed than an easy latency value to a compiler, an architect understands that a "control pipeline" can 
be deleted or it can be made to decrease depending on the case. Many delay circuits are deleted from a 
processor with this technique, and, thereby, a physical circuit dimension is decreased. Instead, a 
compiler uses the detailed information, and carries out the schedule of the different bit field relevant to 
desired actuation according to an individual, consequently each bit, i.e., a bit field, is not loaded to an 
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instruction register 46, and it is not provided for a functional circuit to the machine cycle demanded 
actually. 

For example, in a predetermined cycle (PC=i), it is assumed that he wants to start the actuation which 
should be performed with the functional unit which has the latency of a four cycle. At the actual 
example of the processor explained here, it is the guard selection field (-- .G.RA) to a unit PC=i+OD It 
can be made to be able to generate, the op code can be generated in PC=i +1, and one or the argument 
selection field beyond it (— .-.RA) can be generated in PC=i +2 or PC=i +3. Destination control field 
(DEC. -- .-.SE and DEC. ~ .-.WA) are not required to the last cycle of a latency period, but can be 
generated in cycle PC=i +4. 

Although such a means is known theoretically, actual already becoming complicated to a known 
exception situation processor is known. That reason is that the clear answer which asks and receives 
"whether the activation front stirrup completed this actuation after activation" does not exist any longer, 
this complexity - 1990 year 1 [ for example, ] The CYDRA-5 Departmental Supercomputer' of 
COMPUTER of the moon — Rau etc. — reference is made. Refer to 'Clearly, handling an exception-' 
which follow 26 pages from 25 -page item initiation especially. Therefore, an architect prevents from 
deleting control pipeline arrangement, therefore takes care not to decrease a circuit dimension by these 
considerations. 

To it, according to the guard device of the equipment explained here, the clear answer to this question 
can be permitted, therefore an architect can delete control pipeline arrangement. If the exception 
situation occurs, when generating the guard selection field to pending actuation before the guard 
enforcement signal GF becomes active, while the exception situation is detected, it is necessary to 
consider each pending actuation completed only in that case. In this case, when generating other fields, 
it is important for the architect of the equipment which determines to remove control pipeline 
arrangement freely, and its scheduling compiler. 

Since the timing of generating of the guard selection field is different about detailed activation of 
actuation of each different type, the architect of a processor needs to take care so that the time amount 
arrangement of the change between a playback mode and a recording mode may be appropriately carried 
out to each functional unit. 

This contractor understands the approach of using the above-mentioned record/playback device circuit, 
its above, and other deformation to exception situation recovery of wide range data processing system 
other than the VLIW processor explained here. 



[Translation done.] 
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CLAIMS 



[Claim(s)] 

1. In Operating Data Processing System Controlled by Sequence of Instruction Which Forms Program 
(a) Said program is performed while specifying the number of the checkpoints of said program. A step 
including the data transfer of the 1st type depending on the machine condition that this activation was 
specified with the data transfer in said system, (b) The step which answers the exception situation 
produced during activation of this step (a), and interrupts said program execution, (c) In the data- 
processing-system actuation approach equipped with the step which reruns said program from the latest 
checkpoint after solution of said interruption Said step (a) includes record of said 1st type of each data 
transfer from a checkpoint recently [ said ]. Said step (c) The data-processing-system actuation approach 
characterized by including the playback of the recorded data which synchronized with said rerun instead 
of actual data transfer. 

2. Said step (c) is the data-processing-system actuation approach of the claim 1 publication characterized 
by including the restart of usual activation of said program while recording said 1st type of data transfer 
according to said step (a), when said rerun reaches the point that said activation was interrupted for said 
step (b). 

3. Claim 1 characterized by enabling it to complete those activation with instruction which transmitted 
control of said system to exception situation manipulation routine from said program, was delayed in 
transfer of said control as a result of interruption of activation of said step (b), and was generated before 
said exception situation, or data-processing-system actuation approach of two publications. 

4. Data-processing-system actuation approach of claim 3 publication characterized by forbidding 
actuation started during said delay before transfer of control to said exception situation manipulation 
routine so that it may not have effect of arbitration in said specified machine condition. 

5. For said 1st type of data transfer, said specified machine condition is the data-processing-system 
actuation approach of the claim 1 publication characterized by including transmitting read-out of a value 
and its value from the location of said data memory to the processing component of said system 
including the content of the data memory of said system. 

6. Claim 1 which makes said specified machine condition subset of all machine conditions of said 
processing system, and is characterized by specifying said checkpoint at point of said program in case 
only said specified machine condition relates to next instruction of said program, or data-processing- 
system actuation approach given in two. 

7. It is Data Processing System. A Means to Generate Sequence of Instruction Which Forms Program on 
the occasion of Activation by this System, A means to identify the checkpoint in said program While 
performing said program, data transfer of the 1st type is performed. The 1st data path for which each of 
such data transfer depends on the machine condition that said system was specified, A means to answer 
the exception situation, and to interrupt said program execution In data processing system equipped with 
a means to rerun a program from the recently checkpoint after solution of said exception situation Said 
system A means to connect with said data path and to record each of said 1st type of data transfer from a 
checkpoint recently [ said ] While interrupting said account data path of rerun Nakamae Data processing 
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system characterized by having further a means to reproduce the recorded data transfer instead of a6Wl 
data transfer synchronizing with said rerun. 

8. Data processing system of claim 7 publication characterized by including means to reboot said 
playback means if said rerun reaches point that said activation was interrupted before while restoring 
said data path. 

9. Claim 7 characterized by including means to make it make those activation complete by actuation 
which said exception situation was answered, control of said system was transmitted to exception 
situation manipulation routine from said program, and transfer of this control was delayed, and was 
started before said activation, or data processing system given in eight. 

10. Data processing system of the claim 9 publication characterized by forbidding the actuation started 
during the delay before a transfer of the control to said exception situation manipulation routine, and 
including a means to make it such actuation not affect said specified machine condition. 

1 1 . For said data path, said specified machine condition is the data processing system of the claim 7 
publication characterized by connecting the read-out port of said data memory to the processing 
component of said system including the content of the data memory of said system. 

12. Said data memory is the data processing system of the claim 1 1 publication characterized by having 
a virtual storage system. 

13. The claim 7 characterized by including two or more functional units which are controlled by each 
field of each instruction and perform each actuation to juxtaposition, or data processing system given in 
eight. 

14. it be the data processing system of the claim 13 publication which carry out [ also include a means 
forbid the actuation which be complete before detection of said activation , and ] as the description by 
prepare each guard signal which prevent those actuation that affect said specified machine condition in 
spite of generating of a suitable instruction facultative in some of said functional units [ at least ] , and 
said system answer detection of said activation , and force each guard signal of said functional unit in 
non-conditions . 

15. In Performing Predetermined Actuation, Control Said Processing System by Field of One or More 
Instructions. Decrease control pipeline arrangement and at least one guard signal which prevents the 
predetermined actuation which affects said system at said specified machine condition facultative is 
prepared. Said field includes the guard selection field which chooses one guard signal from the guard 
signal which can use plurality to predetermined actuation. By referring to whether said guard selection 
field was generated before said interruption in contrast with the field of other arbitration The claim 7 
characterized by specifying actuation so that it may generate before the interruption produced according 
to the exception situation, or data processing system given in eight. 

16. Data processing system of the claim 14 publication characterized by controlling the read-out 
actuation and the write-in actuation to said data memory during rerun of an instruction. 

In Said Record Means List, 1 7. Said Interruption and Playback Means The multiplexer connected to said 
data path by having the output section linked to the 1st input section linked to said storage means, and 
said processing means, The event memory circuit which has the data input section linked to said data 
path, the data output section linked to the 2nd input section of said multiplexer, and the address input 
section, While connecting said storage means to said processing means by said multiplexer among a (i) 
recording mode While making said event memory memorize each value transmitted to said processing 
means from said storage means and connecting the data output section of said event memory to it by 
said multiplexer among the (ii) playback mode at said processing means The claim 7 characterized by 
having the control circuit which reproduces the value memorized by said recording mode before by said 
event memory, or data processing system given in eight. 



[Translation done.] 
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